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NOVEL HUMAN TRANSPORTER PROTEINS AND 
POLYNUCLEOTIDES ENCODING THE SAME 

The present application claims the benefit of U.S. 
5 Provisional Application Numbers 60/275,009 and 60/284,152, 
which were filed on March 12, 2001 and April 17, 2001, 
respectively. These provisional applications are herein 
incorporated by reference in their entirety. 

1 . INTRODUCTION 
10 The present invention relates to the discovery, 

identification, and characterization of novel human 
polynucleotides encoding proteins that share sequence 
similarity with mammalian transporter proteins. The invention 
encompasses the described polynucleotides, host cell expression 
15 systems, the encoded proteins, fusion proteins, polypeptides 
and peptides, antibodies to the encoded proteins and peptides, 
and genetically engineered animals that either lack or 
overexpress the disclosed polynucleotides, antagonists and 
agonists of the proteins, and other compounds that modulate the 
20 expression or activity of the proteins encoded by the disclosed 
polynucleotides, which can be used for diagnosis, drug 
screening, clinical trial monitoring, the treatment of diseases 
and disorders, and cosmetic or nutriceutical applications. 

2 . BACKGROUND OF THE INVENTION 
25 Transporter proteins are integral membrane proteins that 

mediate or facilitate the passage of materials across the lipid 
bilayer. Given that the transport of materials across the 
membrane can play an important physiological role, transporter 
proteins are good drug targets. Additionally, one of the 
30 mechanisms of drug resistance involves diseased cells using 
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cellular transporter systems to export chemotherapeutic agents 
from the cell. Such mechanisms are particularly relevant to 
cells manifesting resistance to a multiplicity of drugs. 

3 . SUMMARY OF THE INVENTION 

. 5 The present invention relates to the discovery, 

identification, and characterization of nucleotides that encode 
novel human proteins, and the corresponding amino acid 
sequences of these proteins. The novel human proteins (NHPs) , 
described for the first time herein, share structural 

10 similarity with mammalian sodium/bile co- transporters . 

The novel human nucleic acid sequences described herein, 
encode alternative proteins/open reading frames (ORFs) of 377 
amino acids in length (sodium/bile-like transporter, SEQ ID 
NO:2), and 438 amino acids in length (sodium/bile-like 

15 transporter, SEQ ID NO: 5). 

The invention also encompasses agonists and antagonists of 
the described NHPs, including small molecules, large molecules, 
mutant NHPs, or portions thereof, that compete with native 
NHPs, peptides, and antibodies, as well as nucleotide sequences 

2 0 that can be used to inhibit the expression of the described 

NHPs (e.g., antisense and ribozyme molecules, and open reading 
frame or regulatory sequence replacement constructs) or to 
enhance the expression of the described NHPs (e.g., expression 
constructs that place the described polynucleotide under the 

2 5 control of a strong promoter system) , and transgenic animals 

that express a NHP sequence, or "knock-outs" (which can be 
conditional) that do not express a functional NHP. Knock-out 
mice can be produced in several ways, one of which involves the 
use of mouse embryonic stem cell ("ES cell") lines that contain 

3 0 gene trap mutations in a murine homolog of at least one of the 

described NHPs . When the unique NHP sequences described in SEQ 
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ID N0S:l-6 are w knocked-out" they provide a method of 
identifying phenotypic expression of the particular gene, as 
well as a method of assigning function to previously unknown 
genes. In addition, animals in which the unique NHP sequences 
5 described in SEQ ID NOS:l-6 are "knocked-out" provide a unique 
source in which to elicit antibodies to homologous and 
orthologous proteins, which would have been previously viewed 
by the immune system as "self" and therefore would have failed 
to elicit significant antibody responses. 

10 Additionally, the unique NHP sequences described in SEQ ID 

NOS:l-6 are useful for the identification of protein coding 
sequences, and mapping a unique gene to a particular 
chromosome. These sequences identify biologically verified 
exon splice junctions, as opposed to splice junctions that may 

15 have been bioinf ormatically predicted from genomic sequence 

alone. The sequences of the present invention are also useful 
as additional DNA markers for restriction fragment length 
polymorphism (RFLP) analysis, and in forensic biology. 

Further, the present invention also relates to processes 

20 for identifying compounds that modulate, i.e., act as agonists 
or antagonists of, NHP expression and/or NHP activity that 
utilize purified preparations of the described NHPs and/or NHP 
products, or cells expressing the same. Such compounds can be 
used as therapeutic agents for the treatment of any of a wide 

25 variety of symptoms associated with biological disorders or 
imbalances. 

4. DESCRIPTION OF THE SEQUENCE LISTING AND FIGURES 
The Sequence Listing provides the sequences of the 
3 0 described NHP ORFs that encode the described NHP amino acid 
sequences. SEQ ID NOS : 3 and 6 describe nucleotides encoding 
NHP ORFs along with regions of flanking sequence. 
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5 . DETAILED DESCRIPTION OF THE INVENTION 
The NHPs described for the first time herein are novel 
proteins that may be expressed in, inter alia, human cell 
5 lines, brain, pituitary, spinal cord, thymus, spleen, lymph 

node, trachea, lung, kidney, prostate, testis, thyroid, adrenal 
gland, pancreas, salivary gland, stomach, small intestine, 
colon, skeletal muscle, heart, uterus, placenta, mammary gland, 
adipose, skin, esophagus, bladder, cervix, pericardium, fetal 

10 kidney, fetal lung, gall bladder, tongue, aorta, and 6-week old 
embryos (SEQ ID NOS:l-3), or, inter alia, in human cell lines, 
fetal brain, brain, cerebellum, spinal cord, thymus, spleen, 
lymph node, bone marrow, trachea, kidney, fetal liver, liver, 
prostate, testis, thyroid, adrenal gland, pancreas, salivary 

15 gland, stomach, small intestine, colon, skeletal muscle, 

uterus, placenta, mammary gland, esophagus, bladder, cervix, 
rectum, pericardium, hypothalamus, fetal kidney, fetal lung, 
gall bladder, tongue, aorta, 6-, 9-, and 12-week old embryos, 
adenocarcinoma, osteosarcoma, and embryonic carcinoma cells 

20 (SEQ ID NOS:4-6) . 

The present invention encompasses the nucleotides 
presented in the Sequence Listing, host cells expressing such 
nucleotides, the expression products of such nucleotides, and: 
(a) nucleotides that encode mammalian homologs of the described 

2 5 polynucleotides, including the specifically described NHPs, and 
the NHP products; (b) nucleotides that encode one or more 
portions of the NHPs that correspond to functional domains, and 
the polypeptide products specified by such nucleotide 
sequences, including, but not limited to, the novel regions of 

30 any active domain(s); (c) isolated nucleotides that encode 
mutant versions, engineered or naturally occurring, of the 
described NHPs in which all or a part of at least one domain is 



4 



WO 02/072774 PCT/US02/07438 

deleted or altered, and the polypeptide products specified by 
such nucleotide sequences, including, but not limited to, 
soluble proteins and peptides in which all or a portion of the 
signal (or at least one hydrophobic transmembrane) sequence is 
5 deleted; (d) nucleotides that encode chimeric fusion proteins 
containing all or a portion of a coding region of a NHP, or one 
of its domains (e.g., a receptor or ligand binding domain, 
accessory protein/self -association domain, etc. ) fused to 
another peptide or polypeptide; or (e) therapeutic or 

10 diagnostic derivatives of the described polynucleotides, such 
as oligonucleotides, antisense polynucleotides, ribozymes, 
dsRNA, or gene therapy constructs, comprising a sequence first 
disclosed in the Sequence Listing. 

As discussed above, the present invention includes the 

15 human DNA sequences presented in the Sequence Listing (and 

vectors comprising the same) , and additionally contemplates any 
nucleotide sequence encoding a contiguous NHP open reading 
frame (ORF) that hybridizes to a complement of a DNA sequence 
presented in the Sequence Listing under highly stringent 

20 conditions, e.g. , hybridization to filter-bound DNA in 0.5 M 

NaHP0 4 , 7% sodium dodecyl sulfate (SDS) , 1 mM EDTA at 65 °C, and 
washing in 0 . lx SSC/0.1% SDS at 68°C (Ausubel et al . , eds . , 
1989, Current Protocols in Molecular Biology, Vol. I, Green 
Publishing Associates, Inc., and John Wiley & Sons, Inc., N.Y., 

25 at p. 2.10.3) and encodes a functionally equivalent expression 
product. Additionally contemplated are any nucleotide 
sequences that hybridize to the complement of a DNA sequence 
that encodes and expresses an amino acid sequence presented in 
the Sequence Listing under moderately stringent conditions, 

30 e.g., washing in 0 . 2x SSC/0.1% SDS at 42°C (Ausubel et al . , 

1989, supra), yet still encodes a functionally equivalent NHP 
product. Functional equivalents of a NHP include naturally 
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occurring NHPs present in other species, and mutant NHPs, 
whether naturally occurring or engineered (by site directed 
mutagenesis, gene shuffling, directed evolution as described 
in, for example, U.S. Patent No. 5,837,458). The invention 
5 also includes degenerate nucleic acid variants of the disclosed 
NHP polynucleotide sequences. 

Additionally contemplated are polynucleotides encoding NHP 
ORFs, or their functional equivalents, encoded by 
polynucleotide sequences that are about 99, 95, 90, or about 85 

10 percent similar or identical to corresponding regions of the 
nucleotide sequences of the Sequence Listing (as measured by 
BLAST sequence comparison analysis using, for example, the GCG 
sequence analysis package, as described herein, using standard 
default settings) . 

15 The invention also includes nucleic acid molecules, 

preferably DNA molecules, that hybridize to, and are therefore 
the complements of, the described NHP nucleotide sequences. 
Such hybridization conditions may be highly stringent or less 
highly stringent, as described herein. In instances where the 

20 nucleic acid molecules are deoxyoligonucleotides ("DNA 

oligos" ), such molecules are generally about 16 to about 100 
bases long, or about 20 to about 59 or 80 bases long, or about 
34 to about 45 or 47 bases long, or any variation or 
combination of sizes represented therein that incorporate a 

25 contiguous region of sequence first disclosed in the Sequence 

Listing. Such oligonucleotides can be used in conjunction with 
the polymerase chain reaction (PCR) to screen libraries, 
isolate clones, and prepare cloning and sequencing templates, 
etc. 

30 Alternatively, such NHP oligonucleotides can be used as 

hybridization probes for screening libraries, and assessing 
gene expression patterns (particularly using a microarray or 
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high- throughput "chip" format) . Additionally, a series of NHP 
oligonucleotide sequences, or the complements thereof, can be 
used to represent all or a portion of the described NHP 
sequences. An oligonucleotide or polynucleotide sequence first 
5 disclosed in at least a portion of one or more of the sequences 
of SEQ ID NOS:l-6 can be used as a hybridization probe in 
conjunction with a solid support matrix/ substrate (resins, 
beads, membranes, plastics, polymers, metal or metallized 
substrates, crystalline or polycrystalline substrates, etc.). 

10 Of particular note are spatially addressable arrays (i.e., gene 
chips, microtiter plates, etc.) of oligonucleotides and 
polynucleotides, or corresponding oligopeptides and 
polypeptides, wherein at least one of the biopolymers present 
on the spatially addressable array comprises an oligonucleotide 

15 or polynucleotide sequence first disclosed in at least one of 
the sequences of SEQ ID NOS:l-6, or an amino acid sequence 
encoded thereby. Methods for attaching biopolymers to, or 
synthesizing biopolymers on, solid support matrices, and 
conducting binding studies thereon, are disclosed in, inter 

20 alia, U.S. Patent Nos . 5,700,637, 5,556,752, 5,744,305, 

4,631,211, 5,445,934, 5,252,743, 4,713,326, 5,424,186, and 
4,689,405, the disclosures of which are herein incorporated by 
reference in their entirety. 

Addressable arrays comprising sequences first disclosed in 

25 SEQ ID NOS: 1-6 can be used to identify and characterize the 
temporal and tissue specific expression of a gene. These 
addressable arrays incorporate oligonucleotide sequences of 
sufficient length to confer the required specificity, yet be 
within the limitations of the production technology. The 

3 0 length of these probes is usually within a range of between 
about 8 to about 2000 nucleotides. Preferably the probes 
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consist of 60 nucleotides, and more preferably 25 nucleotides, 
from the sequences first disclosed in SEQ ID NOS:l-6. 

For example, a series of oligonucleotide sequences, or the 
complements thereof, can be used in chip format to represent 
5 all or a portion of the described sequences. The 

oligonucleotides, typically between about 16 to about 40 (or 
any whole number within the stated range) nucleotides in 
length, can partially overlap each other, and/or the sequence 
may be represented using oligonucleotides that do not overlap. 

10 Accordingly, the described polynucleotide sequences shall 
typically comprise at least about two or three distinct 
oligonucleotide sequences of at least about 8 nucleotides in 
length that are each first disclosed in the described Sequence 
Listing. Such oligonucleotide sequences can begin at any 

15 nucleotide present within a sequence in the Sequence Listing, 
and proceed in either a sense (5'-to-3') orientation vis-a-vis 
the described sequence or in an antisense orientation. 

Microarray-based analysis allows the discovery of broad 
patterns of genetic activity, providing new understanding of 

2 0 gene functions, and generating novel and unexpected insight 

into transcriptional processes and biological mechanisms. The 
use of addressable arrays comprising sequences first disclosed 
in SEQ ID NOS:l-6 provides detailed information about 
transcriptional changes involved in a specific pathway, 

2 5 potentially leading to the identification of novel components, 

or gene functions that manifest themselves as novel phenotypes . 

Probes consisting of sequences first disclosed in SEQ ID 
NOS:l-6 can also be used in the identification, selection, and 
validation of novel molecular targets for drug discovery. The 

3 0 use of these unique sequences permits the direct confirmation 

of drug targets, and recognition of drug dependent changes in 
gene expression that are modulated through pathways distinct 
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from the intended target of the drug. These unique sequences 
therefore also have utility in defining and monitoring both 
drug action and toxicity. 

As an example of utility, the sequences first disclosed in 
5 SEQ ID N0S:l-6 can be utilized in microarrays, or other assay 
formats, to screen collections of genetic material from 
patients who have a particular medical condition. These 
investigations can also be carried out using the sequences 
first disclosed in SEQ ID NOS:l-6 in silico, and by comparing 

10 previously collected genetic databases and the disclosed 

sequences using computer software known to those in the art. 

Thus the sequences first disclosed in SEQ ID N0S:l-6 can 
be used to identify mutations associated with a particular 
disease, and also in diagnostic or prognostic assays. 

15 Although the presently described sequences have been 

specifically described using nucleotide sequence, it should be 
appreciated that each of the sequences can uniquely be 
described using any of a wide variety of additional structural 
attributes, or combinations thereof. For example, a given 

2 0 sequence can be described by the net composition of the 

nucleotides present within a given region of the sequence, in 
conjunction with the presence of one or more specific 
oligonucleotide sequence (s) first disclosed in SEQ ID NOS:l-6. 
Alternatively, a restriction map specifying the relative 

25 positions of restriction endonuclease digestion sites, or 
various palindromic or other specific oligonucleotide 
sequences, can be used to structurally describe a given 
sequence. Such restriction maps, which are typically generated 
by widely available computer programs {e.g., the University of 

30 Wisconsin GCG sequence analysis package, SEQUENCHER 3.0, Gene 
Codes Corp., Ann Arbor, MI, etc.), can optionally be used in 
conjunction with one or more discrete nucleotide sequence (s) 

9 
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present in the sequence that can be described by the relative 
position of the sequence relative to one or more additional 
sequence (s) or one or more restriction sites present in the 
disclosed sequence . 
5 For oligonucleotide probes, highly stringent conditions 

may refer, e.g. , to washing in 6x SSC/0.05% sodium 
pyrophosphate at 37°C (for 14-base oligos), 48°C (for 17-base 
oligos) , 55°C (for 20-base oligos), and 60°C (for 23-base 
oligos) . These nucleic acid molecules may encode or act as NHP 

10 antisense molecules, useful, for example, in NHP gene 

regulation and/or as antisense primers in amplification 
reactions of NHP nucleic acid sequences. With respect to NHP 
gene regulation, such techniques can be used to regulate 
biological functions. Further, such sequences may be used as 

15 part of ribozyme and/or triple helix sequences that are also 
useful for NHP gene regulation. 

Inhibitory antisense or double stranded oligonucleotides 
can additionally comprise at least one modified base moiety 
that is selected from the group including, but not limited to, 

2 0 5-f luorouracil, 5-bromouracil , 5-chlorouracil , 5-iodouracil , 
hypoxan thine, xanthine, 4 -ace ty Icy to sine , 

5- (carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil , dihydrouracil , 
beta-D-galactosylqueosine, inosine, N6 - i sopenteny ladenine , 

2 5 1-methylguanine, 1-methylinosine , 2, 2 -dimethylguanine , 

2 -me thy ladenine , 2 -methylguanine , 3 -methylcytosine , 
5 -methylcytosine , N6 -adenine , 7 -methylguanine , 

5-methylaminomethyluracil , 5-methoxyaminomethyl-2- thiouracil , 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil , 

3 0 5-methoxyuracil , 2 -methyl thio-N6-i sopenteny ladenine , uracil- 5- 

oxyacetic acid (v) , wybutoxosine , pseudouracil , queosine, 
2- thiocytosine, 5 -methyl -2- thiouracil , 2- thiouracil , 
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4-thiouracil , 5-methyluracil , uracil-5-oxyacetic acid 
methylester, uracil-5-oxyacetic acid (v) , 5-methyl- 
2-thiouracil , 3- ( 3 -amino-3-N-2 -carboxypropyl ) uracil, (acp3 ) w, 
and 2 , 6-diaminopurine . 
5 The antisense oligonucleotide can also comprise at least 

one modified sugar moiety selected from the group including, 
but not limited to, arabinose, 2-f luoroarabinose , xylulose, and 
hexose . 

In yet another embodiment, the antisense oligonucleotide 
10 will comprise at least one modified phosphate backbone selected 
from the group including, but not limited to, a 

phosphorothioate , a phosphorodithioate , a phosphoramidothioate , 
a phosphor ami date, a phosphordiamidate , a methylphosphonate , an 
alkyl phosphotriester , and a formacetal or analog thereof. 

15 In yet another embodiment, the antisense oligonucleotide 

is an a-anomeric oligonucleotide. An a-anomeric 
oligonucleotide forms specific double- stranded hybrids with 
complementary RNA in which, contrary to the usual (3-units, the 
strands run parallel to each other (Gautier et al . , 1987, Nucl. 

20 Acids Res. 15:6625-6641). The oligonucleotide is a 2'-0- 
methylribonucleotide (Inoue et al . , 1987, Nucl. Acids Res. 
15:6131-6148), or a chimeric RNA-DNA analogue (Inoue et al . , 
1987, FEBS Lett. 215:327-33 0). Alternatively, double stranded 
RNA can be used to disrupt the expression and function of a 

2 5 targeted NHP. 

Oligonucleotides of the invention can be synthesized by 
standard methods known in the art, e.g., by use of an automated 
DNA synthesizer (such as are commercially available from 
Biosearch, Applied Biosystems, etc.) . As examples, 

3 0 phosphorothioate oligonucleotides can be synthesized by the 

method of Stein et al. (1988, Nucl. Acids Res. 15:3209), and 
methylphosphonate oligonucleotides can be prepared by use of 
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controlled pore glass polymer supports (Sarin et al . , 1988, 
Proc. Natl. Acad. Sci. USA 85:7448-7451), etc. 

Low stringency conditions are well-known to those of skill 
in the art, and will vary predictably depending on the specific 
5 organisms from which the library and the labeled sequences are 
derived. For guidance regarding such conditions, see, for 
example, Sambrook et al . , 1989, Molecular Cloning, A Laboratory 
Manual, Cold Spring Harbor Press, Cold Spring Harbor, N.Y. (and 
periodic updates thereof), and Ausubel et al . , 1989, supra. 

10 Alternatively, suitably labeled NHP nucleotide probes can 

be used to screen a human genomic library using appropriately 
stringent conditions or by PCR. The identification and 
characterization of human genomic clones is helpful for 
identifying polymorphisms (including, but not limited to, 

15 nucleotide repeats, microsatellite alleles, single nucleotide 
polymorphisms, or coding single nucleotide polymorphisms) , 
determining the genomic structure of a given locus /allele , and 
designing diagnostic tests. For example, sequences derived 
from regions adjacent to the intron/exon boundaries of the 

2 0 human gene can be used to design primers for use in 

amplification assays to detect mutations within the exons, 
introns, splice sites (e.g., splice acceptor and/or donor 
sites), etc., that can be used in diagnostics and 
pharmacogenomics . 

2 5 For example, the present sequences can be used in 

restriction fragment length polymorphism (RFLP) analysis to 
identify specific individuals. In this technique, an 
individual's genomic DNA is digested with one or more 
restriction enzymes, and probed on a Southern blot to yield 

3 0 unique bands for identification (as generally described in U.S. 

Patent No. 5,272,057, incorporated herein by reference). In 
addition, the sequences of the present invention can be used to 



12 



WO 02/072774 PCT/US02/07438 

provide polynucleotide reagents, e.g., PCR primers, targeted to 
specific loci in the human genome, which can enhance the 
reliability of DNA-based forensic identifications by, for 
example, providing another "identification marker" (i.e., 
5 another DNA sequence that is unique to a particular 

individual). Actual base sequence information can be used for 
identification as an accurate alternative to patterns formed by 
restriction enzyme generated fragments. 

Further, a NHP gene homolog can be isolated from nucleic 

10 acid from an organism of interest by performing PCR using two 

degenerate or "wobble" oligonucleotide primer pools designed on 
the basis of amino acid sequences within the NHP products 
disclosed herein. The template for the reaction may be total 
RNA, mRNA, and/or cDNA obtained by reverse transcription of 

15 mRNA, prepared from human or non-human cell lines or tissue 
known to express, or suspected of expressing, an allele of a 
NHP gene . 

The PCR product can be subcloned and sequenced to ensure 
that the amplified sequences represent the sequence of the 

2 0 desired NHP gene. The PCR fragment can then be used to isolate 
a full length cDNA clone by a variety of methods. For example, 
the amplified fragment can be labeled and used to screen a cDNA 
library, such as a bacteriophage cDNA library. Alternatively, 
the labeled fragment can be used to isolate genomic clones via 

2 5 the screening of a genomic library. 

PCR technology can also be used to isolate full length 
cDNA sequences. For example, RNA can be isolated, following 
standard procedures, from an appropriate cellular or tissue 
source (i.e., one known to express, or suspected of expressing, 

30 a NHP gene) . A reverse transcription (RT) reaction can be 

performed on the RNA using an oligonucleotide primer specific 
for the most 5' end of the amplified fragment for the priming 
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of first strand synthesis. The resulting RNA/DNA hybrid may 
then be "tailed" using a standard terminal transferase 
reaction, the hybrid may be digested with RNase H, and second 
strand synthesis may then be primed with a complementary 
5 primer. Thus, cDNA sequences upstream of the amplified 

fragment can be isolated. For a review of cloning strategies 
that can be used, see, e.g., Sambrook et al . , 1989, supra. 

A cDNA encoding a mutant NHP sequence can be isolated, for 
example, by using PCR. In this case, the first cDNA strand may 

10 be synthesized by hybridizing an oligo-dT oligonucleotide to 
mRNA isolated from tissue known to express, or suspected of 
expressing, a NHP, in an individual putatively carrying a 
mutant NHP allele, and by extending the new strand with reverse 
transcriptase. The second strand of the cDNA is then 

15 synthesized using an oligonucleotide that hybridizes 

specifically to the 5' end of the normal sequence. Using these 
two primers, the product is then amplified via PCR, optionally 
cloned into a suitable vector, and subjected to DNA sequence 
analysis through methods well-known to those of skill in the 

2 0 art. By comparing the DNA sequence of the mutant NHP allele to 
that of a corresponding normal NHP allele, the mutation (s) 
responsible for the loss or alteration of function of the 
mutant NHP gene product can be ascertained. 

Alternatively, a genomic library can be constructed using 

2 5 DNA obtained from an individual suspected of carrying, or known 

to carry, a mutant NHP allele (e.g., a person manifesting a 
NHP-associated phenotype such as, for example, obesity, high 
blood pressure, connective tissue disorders, infertility, 
etc. ) , or a cDNA library can be constructed using RNA from a 

3 0 tissue known to express, or suspected of expressing, a mutant 

NHP allele. A normal NHP gene, or any suitable fragment 
thereof, can then be labeled and used as a probe to identify 
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the corresponding mutant NHP allele in such libraries. Clones 
containing mutant NHP sequences can then be purified and 
subjected to sequence analysis according to methods well-known 
to those skilled in the art. 
5 Additionally, an expression library can be constructed 

utilizing cDNA synthesized from, for example, RJNFA isolated from 
a tissue known to express, or suspected of expressing, a mutant 
NHP allele in an individual suspected of carrying, or known to 
carry, such a mutant allele. In this manner, gene products 

10 made by the putatively mutant tissue can be expressed and 
screened using standard antibody screening techniques in 
conjunction with antibodies raised against a normal NHP 
product, as described below (for screening techniques, see, for 
example, Harlow and Lane, eds . , 1988, "Antibodies: A Laboratory 

15 Manual", Cold Spring Harbor Press, Cold Spring Harbor, N . Y . ) . 

Additionally, screening can be accomplished by screening 
with labeled NHP fusion proteins, such as, for example, 
alkaline phosphatase-NHP or NHP-alkaline phosphatase fusion 
proteins. In cases where a NHP mutation results in an 

20 expression product with altered function (e.g., as a result of 
a missense or a frameshif t mutation) , polyclonal antibodies to 
a NHP are likely to cross-react with a corresponding mutant NHP 
expression product. Library clones detected via their reaction 
with such labeled antibodies can be purified and subjected to 

25 sequence analysis according to methods well-known in the art. 
The invention also encompasses: (a) DNA vectors that 
contain any of the foregoing NHP coding sequences and/or their 
complements (i.e., antisense) ; (b) DNA expression vectors that 
contain any of the foregoing NHP coding sequences operatively 

3 0 associated with a regulatory element that directs the 

expression of the coding sequences (for example, baculovirus as 
described in U.S. Patent No. 5,869,336 herein incorporated by 
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reference) ; (c) genetically engineered host cells that contain 
any of the foregoing NHP coding sequences operatively 
associated with a regulatory element that directs the 
expression of the coding sequences in the host cell; and (d) 
5 genetically engineered host cells that express an endogenous 
NHP sequence under the control of an exogenously introduced 
regulatory element (i.e., gene activation). As used herein, 
regulatory elements include, but are not limited to, inducible 
and non-inducible promoters, enhancers, operators, and other 

10 elements known to those skilled in the art that drive and 

regulate expression. Such regulatory elements include, but are 
not limited to, the cytomegalovirus (hCMV) immediate early 
gene, regulatable, viral elements (particularly retroviral LTR 
promoters), the early or late promoters of SV40 or adenovirus, 

15 the lac system, the trp system, the TAC system, the TRC system, 
the major operator and promoter regions of phage lambda, the 
control regions of fd coat protein, the promoter for 
3-phosphoglycerate kinase (PGK) , the promoters of acid 
phosphatase, and the promoters of the yeast a-mating factors. 

2 0 The present invention also encompasses antibodies and 

anti-idiotypic antibodies (including Fab fragments) , 
antagonists and agonists of a NHP, as well as compounds or 
nucleotide constructs that inhibit expression of a NHP sequence 
(transcription factor inhibitors, antisense and ribozyme 

2 5 molecules, or open reading frame sequence or regulatory 

sequence replacement constructs) , or promote the expression of 
a NHP (e.g., expression constructs in which NHP coding 
sequences are operatively associated with expression control 
elements such as promoters, promoter/ enhancers , etc.). 

3 0 The NHPs or NHP peptides, NHP fusion proteins, NHP 

nucleotide sequences, antibodies, antagonists and agonists can 
be useful for the detection of mutant NHPs, or inappropriately 
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expressed NHPs , for the diagnosis of disease. The NHP proteins 
or peptides, NHP fusion proteins, NHP nucleotide sequences, 
host cell expression systems, antibodies, antagonists, agonists 
and genetically engineered cells and animals can be used for 
5 screening for drugs (or high throughput screening of 

combinatorial libraries) effective in the treatment of the 
symptomatic or phenotypic manifestations of perturbing the 
normal function of a NHP in the body. The use of engineered 
host cells and/or animals may offer an advantage in that such 
10 systems allow not only for the identification of compounds that 
bind to the endogenous receptor for a NHP, but can also 
identify compounds that trigger NHP-mediated activities or 
pathways . 

Finally, the NHP products can be used as therapeutics. 

15 For example, soluble derivatives such as NHP peptides/domains 
corresponding to NHPs, NHP fusion protein products (especially 
NHP-Ig fusion proteins, i.e., fusions of a NHP, or a domain of 
a NHP, to an IgFc) , NHP antibodies and anti-idiotypic 
antibodies (including Fab fragments) , antagonists or agonists 

2 0 (including compounds that modulate or act on downstream targets 
in a NHP-mediated pathway) can be used to directly treat 
diseases or disorders., For instance, the administration of an 
effective amount of a soluble NHP , a NHP-IgFc fusion protein, 
or an anti-idiotypic antibody (or its Fab) that mimics a NHP 

2 5 could activate or effectively antagonize the endogenous NHP 

receptor. Nucleotide constructs encoding such NHP products can 
be used to genetically engineer host cells to express such 
products in vivo; these genetically engineered cells function 
as "bioreactors " in the body delivering a continuous supply of 

3 0 a NHP, a NHP peptide, or a NHP fusion protein to the body. 

Nucleotide constructs encoding functional NHPs, mutant NHPs, as 
well as antisense and ribozyme molecules can also be used in 
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"gene therapy" approaches for the modulation of NHP expression. 
Thus, the invention also encompasses pharmaceutical 
formulations and methods for treating biological disorders. 

Various aspects of the invention are described in greater 
5 detail in the subsections below. 

5 . 1 THE NHP SEQUENCES 
The cDNA sequences and the corresponding deduced amino 
acid sequences of the described NHPs are presented in the 

10 Sequence Listing. The NHP nucleotides were obtained from 
clustered human genomic sequences, testis and mammary 
transcript RACE products, ESTs , and human cDNAs made from fetal 
kidney, lymph node, trachea, lung, mammary gland, bone marrow, 
and pericardium (SEQ ID NOS:l-3), and kidney, fetal kidney, 

15 mammary gland, and prostate (SEQ ID NOS:4-6) mRNAs (Edge 

Biosystems, Gaithersburg, MD, and Clontech, Palo Alto, CA) . 

One polymorphisms was found, a C/T polymorphism at the 
nucleotide position corresponding to nucleotide 42 6 of, for 
example, SEQ ID NO : 4 , both of which result in an asn being 

20 present at corresponding amino acid position 142 of SEQ ID 
NO: 5. SEQ ID NOS:l-3 are apparently encoded on human 
chromosome 4 (see GENBANK accession no. AC079237), and SEQ ID 
NOS:4-6 are apparently encoded on human chromosome 8 (see 
GENBANK accession no. AC024371). Accordingly, the described 

2 5 sequences are useful for mapping and/or defining the 

corresponding coding regions of the human genome. 

An additional application of the described novel human 
polynucleotide sequences is their use in the molecular 
mutagenesis/evolution of proteins that are at least partially 

3 0 encoded by the described novel sequences using, for example, 

polynucleotide shuffling or related methodologies. Such 
approaches are described in U.S. Patent Nos . 5,830,721 and 
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5,837,458, which are herein incorporated by reference in their 
entirety. 

NHP gene products can also be expressed in transgenic 
animals. Animals of any species, including, but not limited 
5 to, worms, mice, rats, rabbits, guinea pigs, pigs, micro-pigs, 
birds, goats, and non-human primates, e.g., baboons, monkeys, 
and chimpanzees, may be used to generate NHP transgenic 
animals . 

Any technique known in the art may be used to introduce a 

10 NHP transgene into animals to produce the founder lines of 
transgenic animals. Such techniques include, but are not 
limited to, pronuclear microinjection (Hoppe and Wagner, 1989, 
U.S. Patent No. 4,873,191); retrovirus-mediated gene transfer 
into germ lines (Van der Putten et al . , 1985, Proc . Natl. Acad. 

15 Sci. USA 82:6148-6152); gene targeting in embryonic stem cells 
(Thompson et al . , 1989, Cell 56:313-321); elec troporation of 
embryos (Lo, 1983, Mol Cell. Biol. 3:1803-1814); and sperm- 
mediated gene transfer (Lavitrano et al . , 1989, Cell 57:717- 
723); etc. For a review of such techniques, see Gordon, 1989, 

20 Transgenic Animals, Intl. Rev. Cytol . 115:171-229, which is 
incorporated by reference herein in its entirety. 

The present invention provides for transgenic animals that 
carry a NHP transgene in all their cells, as well as animals 
that carry a transgene in some, but not all their cells, i.e., 

25 mosaic animals or somatic cell transgenic animals. A transgene 
may be integrated as a single transgene, or in concatamers, 
e.gr. , head-to-head tandems or head-to-tail tandems. A 
transgene may also be selectively introduced into and activated 
in a particular cell-type by following, for example, the 

30 teaching of Lasko et al . , 1992, Proc. Natl. Acad. Sci. USA 
82:6232-6236. The regulatory sequences required for such a 
cell-type specific activation will depend upon the particular 
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cell-type of interest, and will be apparent to those of skill 
in the art. 

When it is desired that a NHP transgene be integrated into 
the chromosomal site of the endogenous NHP gene, gene targeting 
5 is preferred. Briefly, when such a technique is to be 
utilized, vectors containing some nucleotide sequences 
homologous to the endogenous NHP gene are designed for the 
purpose of integrating, via homologous recombination with 
chromosomal sequences, into and disrupting the function of the 

10 nucleotide sequence of the endogenous NHP gene (i.e., 
"knockout" animals) . 

The transgene can also be selectively introduced into a 
particular cell- type, thus inactivating the endogenous NHP gene 
in only that cell- type, by following, for example, the teaching 

15 of Gu et al. t 1994, Science 265:103-106. The regulatory 

sequences required for such a cell-type specific inactivation 
will depend upon the particular cell- type of interest, and will 
be apparent to those of skill in the art. 

Once transgenic animals have been generated, the 

20 expression of the recombinant NHP gene may be assayed utilizing 
standard techniques. Initial screening may be accomplished by 
Southern blot analysis or PCR techniques to analyze animal 
tissues to assay whether integration of the transgene has taken 
place. The level of mRNA expression of the transgene in the 

2 5 tissues of the transgenic animals may also be assessed using 

techniques that include, but are not limited to, Northern blot 
analysis of tissue samples obtained from the animal, in situ 
hybridization analysis, and RT-PCR. Samples of NHP gene- 
expressing tissue may also be evaluated immunocytochemically 

30 using antibodies specific for the NHP transgene product. 

The present invention also provides for "knock-in" 
animals. Knock-in animals are those in which a polynucleotide 
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sequence (i.e., a gene or a cDNA) that the animal does not 
naturally have in its genome is inserted in such a way that it 
is expressed. Examples include, but are not limited to, a 
human gene or cDNA used to replace its murine ortholog in the 
5 mouse, a murine cDNA used to replace the murine gene in the 
mouse, and a human gene or cDNA or murine cDNA that is tagged 
with a reporter construct used to replace the murine .ortholog 
or gene in the mouse. Such replacements can occur at the locus 
of the murine ortholog or gene, or at another specific site. 
10 Such knock-in animals are useful for the in vivo study, testing 
and validation of, intra alia, human drug targets, as well as 
for compounds that are directed at the same and therapeutic 
proteins . 

15 5.2 NHPS AND NHP POLYPEPTIDES 

NHPs, NHP polypeptides, NHP peptide fragments, mutated, 
truncated, or deleted forms of the NHPs, and/or NHP fusion 
proteins can be prepared for a variety of uses. These uses 
include, but are not limited to, the generation of antibodies, 

20 as reagents in diagnostic assays, for the identification of 

other cellular gene products related to a NHP, and as reagents 
in assays for screening for compounds that can be used as 
pharmaceutical reagents useful in the therapeutic treatment of 
mental, biological, or medical disorders and diseases. Given 

2 5 the similarity information and expression data, the described 
NHPs can be targeted (by drugs, oligos, antibodies, etc.) in 
order to treat disease, or to therapeutically augment the 
efficacy of, for example, chemo therapeutic agents used in the 
treatment of breast or prostate cancer. 

30 The Sequence Listing discloses the amino acid sequences 

encoded by the described NHP polynucleotides. The NHPs 
typically display initiator methionines in DNA sequence 
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contexts consistent with a translation initiation site. SEQ ID 
NO: 5 displays a signal type sequence similar to those often 
found on membrane proteins; however, all of the described 
proteins display multiple transmembrane hydrophobic domains 
5 typical of membrane associated proteins. 

The NHP amino acid sequences of the invention include the 
amino acid sequence presented in the Sequence Listing, as well 
as analogues and derivatives thereof. Further, corresponding 
NHP homologues from other species are encompassed by the 

10 invention. In fact, any NHP protein encoded by the NHP 

nucleotide sequences described herein are within the scope of 
the invention, as are any novel polynucleotide sequences 
encoding all or any novel portion of an amino acid sequence 
presented in the Sequence Listing. The degenerate nature of 

15 the genetic code is well-known, and, accordingly, each amino 
acid presented in the Sequence Listing is generically 
representative of the well-known nucleic acid "triplet" codon, 
or in many cases codons, that can encode the amino acid. As 
such, as contemplated herein, the amino acid sequences 

2 0 presented in the Sequence Listing, when taken together with the 
genetic code (see, for example, Table 4-1 at page 109 of 
"Molecular Cell Biology", 1986, J. Darnell et al . , eds . , 
Scientific American Books, New York, N.Y., herein incorporated 
by reference) , are generically representative of all the 

2 5 various permutations and combinations of nucleic acid sequences 

that can encode such amino acid sequences. 

The invention also encompasses proteins that are 
functionally equivalent to the NHPs encoded by the presently 
described nucleotide sequences, as judged by any of a number of 

3 0 criteria, including, but not limited to, the ability to bind 

and cleave a substrate of a NHP, the ability to effect an 
identical or complementary downstream pathway, or a change in 
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cellular metabolism (e.g., proteolytic activity, ion flux, 
tyrosine phosphorylation, transport, etc.). Such functionally 
equivalent NHP proteins include, but are not limited to, 
additions or substitutions of amino acid residues within the 
5 amino acid sequence encoded by the NHP nucleotide sequences 
described herein, but that result in a silent change, thus 
producing a functionally equivalent expression product. Amino 
acid substitutions may be made on the basis of similarity in 
polarity, charge, solubility, hydrophobici ty , hydrophilicity , 

10 and/or the amphipathic nature of the residues involved. For 
example, nonpolar (hydrophobic) amino acids include alanine, 
leucine, isoleucine, valine, proline, phenylalanine, 
tryptophan, and methionine; polar neutral amino acids include 
glycine, serine, threonine, cysteine, tyrosine, asparagine, and 

15 glutamine; positively charged (basic) amino acids include 
arginine, lysine, and histidine; and negatively charged 
(acidic) amino acids include aspartic acid and glutamic acid. 

A variety of host-expression vector systems can be used to 
express the NHP nucleotide sequences of the invention. Where, 

2 0 as in the present instance, the NHP peptides or polypeptides 

are thought to be from a membrane protein, the hydrophobic 
regions of the protein can be excised and the resulting soluble 
peptides or polypeptides can be recovered from the culture 
media. Such expression systems also encompass engineered host 
25 cells that express a NHP, or a functional equivalent, in situ. 
Purification or enrichment of a NHP from such expression 
systems can be accomplished using appropriate detergents and 
lipid micelles and methods well-known to those skilled in the 
art. However, such engineered host cells themselves may be 

3 0 used in situations where it is important not only to retain the 

structural and functional characteristics of a NHP, but to 
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assess biological activity, e.g., in certain drug screening 
assays . 

The expression systems that may be used for purposes of 
the invention include, but are not limited to, microorganisms 
5 such as bacteria {e.g., E. coll, B. subtil Is) transformed with 
recombinant bacteriophage DNA, plasmid DNA or cosmid DNA 
expression vectors containing NHP nucleotide sequences; yeast 
(e.g. , Saccharomyces , Pichia) transformed with recombinant 
yeast expression vectors containing NHP nucleotide sequences; 

10 insect cell systems infected with recombinant virus expression 
vectors (e.g., baculovirus) containing NHP nucleotide 
sequences; plant cell systems infected with recombinant virus 
expression vectors (e.g., cauliflower mosaic virus, CaMV; 
tobacco mosaic virus, TMV) or transformed with recombinant 

15 plasmid expression vectors (e.g., Ti plasmid) containing NHP 
nucleotide sequences; or mammalian cell systems (e.g., COS, 
CHO, BHK, 293, 3T3 ) harboring recombinant expression constructs 
containing NHP nucleotide sequences and promoters derived from 
the genome of mammalian cells (e.g., metallothionein promoter) 

20 or from mammalian viruses (e.g., the adenovirus late promoter; 
the vaccinia virus 7 . 5K promoter) . 

In bacterial systems, a number of expression vectors may 
be advantageously selected depending upon the use intended for 
the NHP product being expressed. For example, when a large 

25 quantity of such a protein is to be produced for the generation 
of pharmaceutical compositions of or containing a NHP, or for 
raising antibodies to a NHP, vectors that direct the expression 
of high levels of fusion protein products that are readily 
purified may be desirable. Such vectors include, but are not 

30 limited to, the E. coli expression vector pUR278 (Ruther et 

al., 1983, EMBO J. 2:1791), in which a NHP coding sequence may 
be ligated individually into the vector in-frame with the lacZ 
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coding region so that a fusion protein is produced; pIN vectors 
(Inouye and Inouye, 1985, Nucleic Acids Res. 13:3101-3109; Van 
Heeke and Schuster, 1989, J. Biol. Chem. 254:5503-5509); and 
the like. pGEX vectors (Pharmacia or American Type Culture 
5 Collection) can also be used to express foreign polypeptides as 
fusion proteins with glutathione S- transferase (GST) . In 
general, such fusion proteins are soluble and can easily be 
purified from lysed cells by adsorption to glutathione-agarose 
beads followed by elution in the presence of free glutathione. 

10 The pGEX vectors are designed to include thrombin or factor Xa 
protease cleavage sites so that the cloned target expression 
product can be released from the GST moiety. 

In an exemplary insect system, Autographs calif ornica 
nuclear polyhedrosis virus (AcNPV) is used as a vector to 

15 express foreign polynucleotide sequences. The virus grows in 
Spodoptera frugiperda cells. A NHP coding sequence can be 
cloned individually into a non-essential region (for example 
the polyhedrin gene), of the virus and placed under control of 
an AcNPV promoter (for example the polyhedrin promoter) . 

20 Successful insertion of a NHP coding sequence will result in 
inactivation of the polyhedrin gene and production of non- 
occluded recombinant virus (i.e., virus lacking the 
proteinaceous coat coded for by the polyhedrin gene) . These 
recombinant viruses are then used to infect Spodoptera 

2 5 frugiperda cells in which the inserted sequence is expressed 

(e.g., see Smith et al . , 1983, J. Virol. 46:584; Smith, U.S. 
Patent No. 4,215,051). 

In mammalian host cells, a number of viral-based 
expression systems may be utilized. In cases where an 

3 0 adenovirus is used as an expression vector, the NHP nucleotide 

sequence of interest may be ligated to an adenovirus 
transcription/ translation control complex, e.g., the late 
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promoter and tripartite leader sequence. This chimeric 
sequence may then be inserted in the adenovirus genome by in 
vitro or in vivo recombination. Insertion in a non-essential 
region of the viral genome (e.g., region El or E3 ) will result 
5 in a recombinant virus that is viable and capable of expressing 
a NHP product in infected hosts (e.g., see Logan and Shenk, 
1984, Proc. Natl. Acad. Sci . USA 51:3 655-3659). Specific 
initiation signals may also be required for efficient 
translation of inserted NHP nucleotide sequences. These 

10 signals include the ATG initiation codon and adjacent 
sequences. In cases where an entire NHP gene or cDNA, 
including its own initiation codon and adjacent sequences, is 
inserted into the appropriate expression vector, no additional 
translational control signals may be needed. However, in cases 

15 where only a portion of a NHP coding sequence is inserted, 

exogenous translational control signals, including, perhaps, 
the ATG initiation codon, may be provided. Furthermore, the 
initiation codon should be in phase with the reading frame of 
the desired coding sequence to ensure translation of the entire 

2 0 insert. These exogenous translational control signals and 

initiation codons can be of a variety of origins, both natural 
and synthetic. The efficiency of expression may be enhanced by 
the inclusion of appropriate transcription enhancer elements, 
transcription terminators, etc. (see Bitter et al . , 1987, 

25 Methods in Enzymol . 153:516-544). 

In addition, a host cell strain may be chosen that 
modulates the expression of the inserted sequences, or modifies 
and processes the expression product in the specific fashion 
desired. Such modifications (e.g., glycosylation) and 

30 processing (e.gr. , cleavage) of protein products may be 

important for the function of the protein. Different host 
cells have characteristic and specific mechanisms for the post- 
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translational processing and modification of proteins and 
expression products. Appropriate cell lines or host systems 
can be chosen to ensure the desired modification and processing 
of the foreign protein expressed. To this end, eukaryotic host 
5 cells that possess the cellular machinery for the desired 
processing of the primary transcript, glycosylation, and 
phosphorylation of the expression product may be used. Such 
mammalian host cells include, but are not limited to, CHO, 
VERO, BHK, HeLa, COS, MDCK, 293, 3T3 , WI38, and in particular, 

10 human cell lines. 

For long-term, high-yield production of recombinant 
proteins, stable expression is preferred. For example, cell 
lines that stably express the NHP sequences described herein 
can be engineered. Rather than using expression vectors that 

15 contain viral origins of replication, host cells can be 

transformed with DNA controlled by appropriate expression 
control elements (e.g., promoter, enhancer sequences, 
transcription terminators, polyadenylation sites, etc.), and a 
selectable marker. Following the introduction of the foreign 

2 0 DNA, engineered cells may be allowed to grow for 1-2 days in an 
enriched media, and then switched to a selective media. The 
selectable marker in the recombinant plasmid confers resistance 
to the selection and allows cells to stably integrate the 
plasmid into their chromosomes and grow to form foci, which in 

25 turn can be cloned and expanded into cell lines. This method 
may advantageously be used to engineer cell lines that express 
the NHP product. Such engineered cell lines may be 
particularly useful in screening and evaluation of compounds 
that affect the endogenous activity of the NHP product. 

30 A number of selection systems may be used, including, but 

not limited to, the herpes simplex virus thymidine kinase 
(Wigler et al . , 1977, Cell 11:223), hypoxan thine- guanine 
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phosphoribosyltransf erase (Szybalska and Szybalski, 19 62, Proc . 
Natl. Acad. Sci . USA 48:2026), and adenine 

phosphoribosyltransf erase (Lowy et al . , 1980, Cell 22:817) 
genes, which can be employed in tk", hgprt" or aprt" cells, 
5 respectively. Also, antimetabolite resistance can be used as 
the basis of selection for the following genes: dhfr, which 
confers resistance to methotrexate (Wigler et al . , 1980, Proc. 
Natl. Acad. Sci. USA 77:3567; O ' Hare et al . , 1981, Proc. Natl. 
Acad. Sci.. USA 78:1527); gpt, which confers resistance to 

10 mycophenolic acid (Mulligan and Berg, 1981, Proc. Natl. Acad. 
Sci. USA 78:2072); neo, which confers resistance to' the 
aminoglycoside G-418 (Colberre-Garapin et al . , 1981, J. Mol . 
Biol. 150:1); and hygro , which confers resistance to hygromycin 
(Santerre et al . , 1984, Gene 30:147). 

15 Alternatively, any fusion protein can be readily purified 

by utilizing an antibody specific for the fusion protein being 
expressed. Another exemplary system allows for the ready 
purification of non-denatured fusion proteins expressed in 
human cell lines (Janknecht et al . , 1991, Proc. Natl. Acad. 

20 Sci. USA 88:8972-8976). In this system, the sequence of 

interest is subcloned into a vaccinia recombination plasmid 
such that the sequence's open reading frame is translationally 
fused to an amino- terminal tag consisting of six histidine 
residues. Extracts from cells infected with recombinant 

25 vaccinia virus are loaded onto Ni 2+ -nitriloacetic acid-agarose 
columns, and histidine- tagged proteins are selectively eluted 
with imidazole-containing buffers. 

Also encompassed by the present invention are fusion 
proteins that direct a NHP to a target organ and/or facilitate 

3 0 transport across the membrane into the cytosol. Conjugation of 
NHPs to antibody molecules or their Fab fragments could be used 
to target cells bearing a particular epitope. Attaching an 
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appropriate signal sequence to a NHP would also transport a NHP 
to a desired location within the cell. Alternatively targeting 
of a NHP or its nucleic acid sequence might be achieved using 
liposome or lipid complex based delivery systems. Such 
5 technologies are described in "Liposomes: A Practical 

Approach", New, R.R.C., ed. , Oxford University Press, N.Y. , and 
in U.S. Patent Nos . 4,594,595, 5,459,127, 5,948,767 and 
6,110,490 and their respective disclosures, which are herein 
incorporated by reference in their entirety. Additionally 

10 embodied are novel protein constructs engineered in such a way 
that they facilitate transport of NHPs to a target site or 
desired organ, where they cross the cell membrane and/or the 
nucleus where the NHPs can exert their functional activity. 
This goal may be achieved by coupling of a NHP to a cytokine or 

15 other ligand that provides targeting specificity, and/or to a 
protein transducing domain (see generally U.S. Provisional 
Patent Application Ser. Nos. 60/111,701 and 60/056,713, both of 
which are herein incorporated by reference, for examples of 
such transducing sequences) , to facilitate passage across 

2 0 cellular membranes, and can optionally be engineered to include 
nuclear localization signals. 

5 . 3 ANTIBODIES TO NHP PRODUCTS 
Antibodies that specifically recognize one or more 

25 epitopes of a NHP, epitopes of conserved variants of a NHP, or 
peptide fragments of a NHP, are also encompassed by the 
invention. Such antibodies include, but are not limited to, 
polyclonal antibodies, monoclonal antibodies (mAbs) , humanized 
or chimeric antibodies, single chain antibodies, Fab fragments, 

30 F(ab') 2 fragments, fragments produced by a Fab expression 
library, anti-idiotypic (anti-Id) antibodies, and epitope- 
binding fragments of any of the above. 
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The antibodies of the invention may be used, for example, 
in the detection of a NHP in a biological sample and may, 
therefore, be utilized as part of a diagnostic or prognostic 
technique whereby patients may be tested for abnormal amounts 
5 of a NHP. Such antibodies may also be utilized in conjunction 
with, for example, compound screening schemes for the 
evaluation of the effect of test compounds on expression and/or 
activity of a NHP expression product. Additionally, such 
antibodies can be used in conjunction with gene therapy to, for 

10 example, evaluate normal and/or engineered NHP- expressing cells 
prior to their introduction into a patient. Such antibodies 
may additionally be used in methods for the inhibition of 
abnormal NHP activity. Thus, such antibodies may be utilized 
as a part of treatment methods . 

15 For the production of antibodies, various host animals may 

be immunized by injection with a NHP, a NHP peptide (e.g., one 
corresponding to a functional domain of a NHP) , truncated NHP 
polypeptides (NHP in which one or more domains have been 
deleted) , functional equivalents of the NHP, or mutated 

2 0 variants of the NHP. Such host animals may include, but are 
not limited to, pigs, rabbits, mice, goats, and rats, to name 
but a few. Various adjuvants may be used to increase the 
immunological response, depending on the host species, 
including, but not limited to, Freund's adjuvant (complete and 

2 5 incomplete) , mineral salts such as aluminum hydroxide or 

aluminum phosphate, chitosan, surface active substances such as 
lysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, and potentially useful human adjuvants such as BCG 
(bacille Calmette-Guerin) and Corynebacterium parvum. 

3 0 Alternatively, the immune response could be enhanced by 

combination and/or coupling with molecules such as keyhole 
limpet hemocyanin, tetanus toxoid, diphtheria toxoid, 
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ovalbumin, cholera toxin, or fragments thereof. Polyclonal 
antibodies are heterogeneous populations of antibody molecules 
derived from the sera of the immunized animals. 

Monoclonal antibodies, which are homogeneous populations 
5 of antibodies to a particular antigen, can be obtained by any 
technique that provides for the production of antibody 
molecules by continuous cell lines in culture. These include, 
but are not limited to, the hybridoma technique of Kohler and 
Milstein, (1975, Nature 255:495-497; and U.S. Patent No. 

10 4,376,110), the human B-cell hybridoma technique (Kosbor et 
al., 1983, Immunology Today 4:72; Cole et al . , 1983, Proc . 
Natl. Acad. Sci. USA 8(9:2026-2030), and the EBV-hybridoma 
technique (Cole et al . , 1985, Monoclonal Antibodies And Cancer 
Therapy, Alan R. Liss, Inc., pp. 77-96). Such antibodies may 

15 be of any immunoglobulin class, including IgG, IgM, IgE, IgA, 
and IgD, and any subclass thereof. The hybridomas producing 
the mAbs of this invention may be cultivated in vitro or in 
vivo. Production of high titers of mAbs in vivo makes this the 
presently preferred method of production. 

2 0 In addition, techniques developed for the production of 

"chimeric antibodies" (Morrison et al . , 1984, Proc. Natl. Acad. 
Sci. USA 51:6851-6855; Neuberger et al . , 1984, Nature, 322:604- 
608; Takeda et al . , 1985, Nature, 314:452-454), by splicing the 
genes. from a mouse antibody molecule of appropriate antigen 

2 5 specificity together with genes from a human antibody molecule 

of appropriate biological activity, can be used. A chimeric 
antibody is a molecule in which different portions are derived 
from different animal species, such as those having a variable 
region derived from a murine mAb and a human immunoglobulin 

3 0 constant region. Such technologies are described in U.S. 

Patent Nos . 6,114,598, 6,075,181 and 5,877,397 and their 
respective disclosures, which are herein incorporated by 
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reference in their entirety. Also encompassed by the present 
invention is the use of fully humanized monoclonal antibodies, 
as described in U.S. Patent No. 6,150,584 and respective 
disclosures, which are herein incorporated by reference in 
5 their entirety. 

Alternatively, techniques described for the production of 
single chain antibodies (U.S. Patent No. 4,946,778; Bird, 1988, 
Science 242 : 42 3 - 42 6 Huston et al . , 1988, Proc . Natl. Acad. 
Sci. USA 85:5879-5883; and Ward et al . , 1989, Nature 341:544- 
10 546) can be adapted to produce single chain antibodies against 
NHP expression products. Single chain antibodies are formed by 
linking the heavy and light chain fragments of the Fv region 
via an amino acid bridge, resulting in a single chain 
polypeptide . 

15 Antibody fragments that recognize specific epitopes may be 

generated by known techniques. For example, such fragments 
include, but are not limited to: F(ab') 2 fragments, which can be 
produced by pepsin digestion of an antibody molecule; and Fab 
fragments, which can be generated by reducing the disulfide 

20 bridges of F(ab , ) 2 fragments. Alternatively, Fab expression 
libraries may be constructed (Huse et al . , 1989, Science, 
245:1275-1281) to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity. 

Antibodies to a NHP can, in turn, be utilized to generate 

25 anti-idiotype antibodies that "mimic" a given NHP, using 

techniques well-known to those skilled in the art (see, e.g., 
Greenspan and Bona, 1993, FASEB J. 7:437-444; and Nissinoff, 
1991, J. Immunol. 147:2429-2438). For example, antibodies that 
bind to a NHP domain and competitively inhibit the binding of 

3 0 the NHP to its cognate receptor can be used to generate anti- 
idiotypes that "mimic" the NHP and, therefore, bind and 
activate or neutralize a receptor. Such anti-idiotypic 
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antibodies, or Fab fragments of such anti-idiotypes , can be 
used in therapeutic regimens involving a NHP-mediated pathway. 

Additionally, given the high degree of relatedness of 
mammalian NHPs, NHP knock-out mice (having never seen a NHP, 
5 and thus never been tolerized to a NHP } have a unique utility, 
as they can be advantageously applied to the generation of 
antibodies against the disclosed mammalian NHPs (i.e., a NHP 
will be immunogenic in NHP knock-out animals) . 

The present invention is not to be limited in scope by the 

10 specific embodiments described herein, which are intended as 
single illustrations of individual aspects of the invention, 
and functionally equivalent methods and components are within 
the scope of the invention. Indeed, various modifications of 
the invention, in addition to those shown and described herein, 

15 will become apparent to those skilled in the art from the 

foregoing description. Such modifications are intended to fall 
within the scope of the appended claims. All cited 
publications, patents, and patent applications are herein 
incorporated by reference in their entirety. 
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WHAT IS CLAIMED IS: 

1. An isolated nucleic acid molecule comprising at 
least 59 contiguous nucleotides from SEQ ID NO : 1 . 

5 

2. An isolated nucleic acid molecule comprising a 
nucleotide sequence that: 

(a) encodes the amino acid sequence shown in SEQ ID 
NO : 2 ; and 

10 (b) hybridizes under stringent conditions to the 

nucleotide sequence of SEQ ID NO : 1 or the 
complement thereof . 

3 . An isolated nucleic acid molecule comprising a 
15 nucleotide sequence that encodes the amino acid sequence shown 

in SEQ ID NO : 2 . 

4. A substantially isolated protein having the 
transporter activity of the protein shown in SEQ ID NO : 2 , which 

2 0 is encoded by a nucleotide sequence that hybridizes to SEQ ID 

NO : 1 under highly stringent conditions. 

5. A substantially isolated protein according to 
Claim 4 comprising the amino acid sequence of SEQ ID NO : 2 . 

25 

6. An isolated nucleic acid molecule comprising at 
least 47 contiguous nucleotides from SEQ ID NO : 4 . 

7. An isolated nucleic acid molecule comprising a 

3 0 nucleotide sequence that: 

(a) encodes the amino acid sequence shown in SEQ ID 
NO : 5 ; and 
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(b) hybridizes under stringent conditions to the 
nucleotide sequence of SEQ ID NO : 4 or the 
complement thereof . 

5 8. An isolated nucleic acid molecule comprising a 

nucleotide sequence that encodes the amino acid sequence shown 
in SEQ ID NO : 4 . 

9. A substantially isolated protein having the 
10 transporter activity of the protein shown in SEQ ID NO : 5 , which 
is encoded by a nucleotide sequence that hybridizes to SEQ ID 
NO: 4 under highly stringent conditions. 



10. A substantially isolated protein according to 
IB Claim 9 comprising the amino acid sequence of SEQ ID NO : 5 . 
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SEQUENCE LISTING 
<110> LEXICON GENETICS INCORPORATED 

<12 0> Novel Human Transporter Proteins and Polynucleotides Encoding the 
Same 

<130> LEX-0314-PCT 

<150> US 60/275, 009 
<151> 2001-03-12 

<150> US 60/284, 152 
<151> 2001-04-17 

<160> 6 

<170> FastSEQ for Windows Version 4.0 



<210> 1 

<211> 1134 

<212> DNA 

<213> Homo sapiens 



<400> 1 

atgagagcca attgttccag cagctcagcc tgccctgcca acagttcaga ggaggagctg 60 

ccagtgggac tggaggtgca tggaaacctg gagctcgttt tcacagtggt gtccactgtg 120 

atgatggggc tgctcatgtt ctctttggga tgttccgtgg agatccggaa gctgtggtcg 180 

cacatcagga gaccctgggg cattgctgtg ggactgctct gccagtttgg gctcatgcct 240 

tttacagctt atctcctggc cattagcttt tctctgaagc cagtccaagc tattgctgtt 300 

ctcatcatgg gctgctgccc ggggggcacc atctctaaca ttttcacctt ctgggttgat 360 

ggagatatgg atctcagcat cagtatgaca acctgttcca ccgtggccgc cctgggaatg 420 

atgccactct gcatttatct ctacacctgg tcctggagtc ttcagcagaa tctcaccatt 480 

ccttatcaga acataggaat tacccttgtg tgcctgacca ttcctgtggc ctttggtgtc 540 

tatgtgaatt acagatggcc aaaacaatcc aaaatcattc tcaagattgg ggccgttgtt 600 

ggtggggtcc tccttctggt ggtcgcagtt gctggtgtgg tcctggcgaa aggatcttgg 660 

aattcagaca tcacccttct gaccatcagt ttcatctttc ctttgattgg ccatgtcacg 720 

ggttttctgc tggcactttt tacccaccag tcttggcaaa ggtgcaggac aatttcctta 780 

gaaactggag ctcagaatat tcagatgtgc atcaccatgc tccagttatc tttcactgct 840 

gagcacttgg tccagatgtt gagtttccca ctggcctatg gactcttcca gctgatagat 900 

ggatttctta ttgttgcagc atatcagacg tacaagagga gattgaagaa caaacatgga 9 60 

aaaaagaact caggttgcac agaagtctgc catacgagga aatcgacttc ttccagagag 1020 

accaatgcct tcttggaggt gaatgaagaa ggtgccatca ctcctgggcc accagggcca 1080 

atggattgcc acagggctct cgagccagtt ggccacatca cttcatgtga atag 1134 



<210> 2 

<211> 377 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Arg Ala Asn Cys Ser Ser Ser Ser Ala Cys Pro Ala Asn Ser Ser 

15 10 15 

Glu Glu Glu Leu Pro Val Gly Leu Glu Val His Gly Asn Leu Glu Leu 
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20 25 30 

Val^Phe Thr Val Val Ser Thr Val Met Met Gly Leu Leu Met Phe Ser 

3*5 40 45 

Leu Gly Cys Ser Val Glu lie Arg Lys Leu Trp Ser His lie Arg Arg 

50 55 60 - 

Pro Trp Gly lie Ala Val Gly Leu Leu Cys Gin Phe Gly Leu Met Pro 
65 70 75 80 

Phe Thr Ala Tyr Leu Leu Ala He Ser Phe Ser Leu Lys Pro Val Gin 

85 90 95 

Ala He Ala Val Leu He Met Gly Cys Cys Pro Gly Gly Thr lie Ser 

100 105 110 

Asn He Phe Thr Phe Trp Val Asp Gly Asp Met Asp Leu Ser He Ser 

115 120 125 

Met Thr Thr Cys Ser Thr Val Ala Ala Leu Gly Met Met Pro Leu Cys 

130 135 140 

He Tyr Leu Tyr Thr Trp Ser Trp Ser Leu Gin Gin Asn Leu Thr He 
145 150 155 160 

Pro Tyr Gin Asn He Gly He Thr Leu Val Cys Leu Thr He Pro Val 

165 170 175 

Ala Phe Gly Val Tyr Val Asn Tyr Arg Trp Pro Lys Gin Ser Lys He 

180 185 190 

He Leu Lys He Gly Ala Val Val Gly Gly Val Leu Leu Leu Val Val 

195 200 205 

Ala Val Ala Gly Val Val Leu Ala Lys Gly Ser Trp Asn Ser Asp He 

210 215 220 

Thr Leu Leu Thr He Ser Phe He Phe Pro Leu He Gly His Val Thr 
225 230 235 240 

Gly Phe Leu Leu Ala Leu Phe Thr His Gin Ser Trp Gin Arg Cys Arg 

245 250 255 

Thr He Ser Leu Glu Thr Gly Ala Gin Asn He Gin Met Cys He Thr 

260 265 270 

Met Leu Gin Leu Ser Phe Thr Ala Glu His Leu Val Gin Met Leu Ser 

275 280 285 

Phe Pro Leu Ala Tyr Gly Leu Phe Gin Leu He Asp Gly Phe Leu He 

290 295 300 

Val Ala Ala Tyr Gin Thr Tyr Lys Arg Arg Leu Lys Asn Lys His Gly 
305 310 315 320 

Lys Lys Asn Ser Gly Cys Thr Glu Val Cys His Thr Arg Lys Ser Thr 

325 330 335 

Ser Ser Arg Glu Thr Asn Ala Phe Leu Glu Val Asn Glu Glu Gly Ala 

340 345 350 

He Thr Pro Gly Pro Pro Gly Pro Met Asp Cys His Arg Ala Leu Glu 

355 360 365 

Pro Val Gly His He Thr Ser Cys Glu 
370 375 

<210> 3 

<211> 1600 

<212> DNA 

<213> Homo sapiens 

<400> 3 

gtgataatta cttttataat gccacttgtg aaaaaattga tcagattagg atgaatcacc 60 

ttgctggcca acagttattg gaatgattct ccatgtgtga cttcgttgca ctattacaaa 120 

atgtggcagg atagacctgc ccagccattg ttgccgatgt tcatttgtaa tgctgcctta 180 

aggagatgag gagatgagag ccaattgttc cagcagctca gcctgccctg ccaacagttc 240 
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agaggaggag ctgccagtgg gactggaggt gcatggaaac ctggagctcg ttttcacagt 3 00 

ggtgtccact gtgatgatgg ggctgctcat gttctctttg ggatgttccg tggagatccg 360 

gaagctgtgg tcgcacatca ggagaccctg gggcattgct gtgggactgc tctgccagtt 420 

tgggctcatg ccttttacag cttatctcct ggccattagc ttttctctga agccagtcca 480 

agctattgct gttctcatca tgggctgctg cccggggggc accatctcta acattttcac 540 

cttctgggtt gatggagata tggatctcag catcagtatg acaacctgtt ccaccgtggc 600 

cgccctggga atgatgccac tctgcattta tctctacacc tggtcctgga gtcttcagca 660 

gaatctcacc attccttatc agaacatagg aattaccctt gtgtgcctga ccattcctgt 720 

ggcctttggt gtctatgtga attacagatg gccaaaacaa tccaaaatca ttctcaagat 780 

tggggccgtt gttggtgggg tcctccttct ggtggtcgca gttgctggtg tggtcctggc 840 

gaaaggatct tggaattcag acatcaccct tctgaccatc agtttcatct ttcctttgat 900 

tggccatgtc acgggttttc tgctggcact ttttacccac cagtcttggc aaaggtgcag 9 60 

gacaatttcc ttagaaactg gagctcagaa tattcagatg tgcatcacca tgctccagtt 1020 

atctttcact gctgagcact tggtccagat gttgagtttc ccactggcct atggactctt 1080 

ccagctgata gatggatttc ttattgttgc agcatatcag acgtacaaga ggagattgaa 1140 

gaacaaacat ggaaaaaaga actcaggttg cacagaagtc tgccatacga ggaaatcgac 120 0 

ttcttccaga gagaccaatg ccttcttgga ggtgaatgaa gaaggtgcca tcactcctgg 1260 

gccaccaggg ccaatggatt gccacagggc tctcgagcca gttggccaca tcacttcatg 1320 

tgaatagcag ggactagctg gctggactgg cccccttctt tttcagtggc cagtaaagac 1380 

agtgtgcagc tgacacatga atcttgttgg tagggccagt gtgaatattt aagtgttcaa 1440 

tgttagaata tttatatttt catgtggatt gtgaattgtg atgggatcac ttttggagat 1500 

tcccatttca gggagtttct tctgggggtt aacataacgt atcaatgagc tgccttgtat 1560 

gttcttgtct tcccatgcag caaatagtac cctggcccta 1600 



<210> 4 
<211> 1317 
<212> DNA 

<213> Homo sapiens 



<400> 4 

atgattagaa aactttttat tgttctactt ttgttgcttg tgactataga agaagcaagg 60 

atgtcatcgc tcagttttct gaatatagag aagactgaaa tactattttt cacaaagact 120 

gaagaaacca tccttgtaag ttcaagctac gaaaataaac ggcctaattc cagccacctc 180 

tttgtgaaaa tagaagatcc taaaatacta caaatggtga atgtggccaa gaagatctca 240 

tcagatgcta caaactttac cataaatctg gtgactgatg aagaaggaga aacaaatgtg 300 

actattcaac tctgggattc tgaaggtagg caagaaagac tcattgaaga aatcaagaat 3 60 

gtgaaagtca aagtgctcaa acaaaaagac agtctactcc aggcaccaat gcatattgat 420 

agaaayatcc taatgcttat tttaccacta atactattga ataagtgtgc atttggttgt 480 

aagattgaat tacagctgtt tcaaacagta tggaagagac ctttgccagt aattcttggg 540 

gcagttacac agttttttct gatgccattt tgcgggtttc ttttgtctca gattgtggca 600 

ttgcctgagg cgcaagcttt tggagttgta atgacctgca cgtgcccagg agggggtggg 660 

ggctatctct ttgctctgct tctagatgga gatttcacat tggccatttt gatgacttgc 720 

acatcaacat tattggctct gatcatgatg cctgtcaatt cttatatata cagtaggata 780 

ttagggttgt caggtacatt ccatattcct gtttctaaaa ttgtgtcaac actccttttc 840 

atacttgtgc cagtatcaat tggaatagtc atcaagcata gaatacctga aaaagcaagc 900 

ttcttagaga gaataattag acctctgagt tttattttaa tgttcgtagg aatttatttg 960 

actttcacag tgggattagt gttcttaaaa acagataatc tagaggtgat tctgttgggt 1020 

ctcttagttc ctgctttggg tttgctgttt gggtactcct ttgctaaagt ttgtacgctg 1080 

cctcttcctg tttgtaaaac tgttgctatt gaaagtggga tgttaaatag tttcttagct 1140 

cttgccgtta ttcagctgtc ttttccacag tccaaggcca atttagcttc tgtggctcct 1200 

tttacagtag ccatgtgttc tggatgtgaa atgttactga tcattctagt ttacaaggct 1260 

aagaaaagat gtatcttttt cttacaagat aaaaggaaaa gaaatttcct aatctaa 1317 



<210> 5 
<211> 438 
<212> PRT 
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<213> Homo sapiens 
<400> 5 

Met lie Arg Lys Leu Phe lie Val Leu -Leu Leu Leu Leu Val Thr lie 

15 10 15 

Glu Glu Ala Arg Met Ser Ser Leu Ser Phe Leu Asn lie Glu Lys Thr 

20 25 30 

Glu lie Leu Phe Phe Thr Lys Thr Glu Glu Thr lie Leu Val Ser Ser 

35 40 45 

Ser Tyr Glu Asn Lys Arg Pro Asn Ser Ser His Leu Phe Val Lys lie 

50 55 60 

Glu Asp Pro Lys lie Leu Gin Met Val Asn Val Ala Lys Lys He Ser 
65 70 75 80 

Ser Asp Ala Thr Asn Phe Thr He Asn Leu Val Thr Asp Glu Glu Gly 

85 90 95 

Glu Thr Asn Val Thr He Gin Leu Trp Asp Ser Glu Gly Arg Gin Glu 

100 105 110 

Arg Leu He Glu Glu He Lys Asn Val Lys Val Lys Val Leu Lys Gin 

115 120 125 

Lys Asp Ser Leu Leu Gin Ala Pro Met His He Asp Arg Asn He Leu 

130 135 140 

Met Leu He Leu Pro Leu He Leu Leu Asn Lys Cys Ala Phe Gly Cys 
145 150 155 160 

Lys He Glu Leu Gin Leu Phe Gin Thr Val Trp Lys Arg Pro Leu Pro 

165 170 175 

Val He Leu Gly Ala Val Thr Gin Phe Phe Leu Met Pro Phe Cys Gly 

180 185 190 

Phe Leu Leu Ser Gin He Val Ala Leu Pro Glu Ala Gin Ala Phe Gly 

195 200 205 

Val Val Met Thr Cys Thr Cys Pro Gly Gly Gly Gly Gly Tyr Leu Phe 

210 215 220 

Ala Leu Leu Leu Asp Gly Asp Phe Thr Leu Ala He Leu Met Thr Cys 
225 230 235 240 

Thr Ser Thr Leu Leu Ala Leu He Met Met Pro Val Asn Ser Tyr He 

245 250 255 

Tyr Ser Arg He Leu Gly Leu Ser Gly Thr Phe His He Pro Val Ser 

260 265 270 

Lys He Val Ser Thr Leu Leu Phe He Leu Val Pro Val Ser He Gly 

275 280 285 

He Val He Lys His Arg He Pro Glu Lys Ala Ser Phe Leu Glu Arg 

290 295 300 

He He Arg Pro Leu Ser Phe He Leu Met Phe Val Gly He Tyr Leu 
305 310 315 320 

Thr Phe Thr Val Gly Leu Val Phe Leu Lys Thr Asp Asn Leu Glu Val 

325 330 335 

He Leu Leu Gly Leu Leu Val Pro Ala Leu Gly Leu Leu Phe Gly Tyr 

340 345 350 

Ser Phe Ala Lys Val Cys Thr Leu Pro Leu Pro Val Cys Lys Thr Val 

355 360 365 

Ala He Glu Ser Gly Met Leu Asn Ser Phe Leu Ala Leu Ala Val He 

370 375 380 

Gin Leu Ser Phe Pro Gin Ser Lys Ala Asn Leu Ala Ser Val Ala Pro 
385 390 395 400 

Phe Thr Val Ala Met Cys Ser Gly Cys Glu Met Leu Leu He He Leu 

405 410 415 

Val Tyr Lys Ala Lys Lys Arg Cys He Phe Phe Leu Gin Asp Lys Arg 
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420 425 430 

Lys Arg Asn Phe Leu lie 
435 

<210> 6 

<211> 1777 

<212> DNA 

<213> Homo sapiens 



<400> 6 

gcataagcag aaatgaggaa atcaagagga agagattaga tttctgttgt gataaatcga 60 

atctgttaaa tgccatgact ttttaattgt cttaatcaca agttaaaccg gttgtgttgc 120 

tgcttagatg gctatatatt tgtttaaaag tacagcagtc cctcctactg gactttgatc 180 

ctacaaaaac aactgttatc taactcaccc tcagactgtc actggaacac ctgcatgaag 240 

aatgttcttt cattttttaa aaacgatttt gcatatatga tttatttcag ctttcaaaat 300 

gattagaaaa ctttttattg ttctactttt gttgcttgtg actatagaag aagcaaggat 360 

gtcatcgctc agttttctga atatagagaa gactgaaata ctatttttca caaagactga 420 

agaaaccatc cttgtaagtt caagctacga aaataaacgg cctaattcca gccacctctt 480 

tgtgaaaata gaagatccta aaatactaca aatggtgaat gtggccaaga agatctcatc 540 

agatgctaca aactttacca taaatctggt gactgatgaa gaaggagaaa caaatgtgac 600 

tattcaactc tgggattctg aaggtaggca agaaagactc attgaagaaa tcaagaatgt 660 

gaaagtcaaa gtgctcaaac aaaaagacag tctactccag gcaccaatgc atattgatag 720 

aaayatccta atgcttattt taccactaat actattgaat aagtgtgcat ttggttgtaa 780 

gattgaatta cagctgtttc aaacagtatg gaagagacct ttgccagtaa ttcttggggc 840 

agttacacag ttttttctga tgccattttg cgggtttctt ttgtctcaga ttgtggcatt 900 

gcctgaggcg caagcttttg gagttgtaat gacctgcacg tgcccaggag ggggtggggg 960 

ctatctcttt gctctgcttc tagatggaga tttcacattg gccattttga tgacttgcac 1020 

atcaacatta ttggctctga tcatgatgcc tgtcaattct tatatataca gtaggatatt 1080 

agggttgtca ggtacattcc atattcctgt ttctaaaatt gtgtcaacac tccttttcat 1140 

acttgtgcca gtatcaattg gaatagtcat caagcataga atacctgaaa aagcaagctt 1200 

cttagagaga ataattagac ctctgagttt tattttaatg ttcgtaggaa tttatttgac 1260 

tttcacagtg ggattagtgt tcttaaaaac agataatcta gaggtgattc tgttgggtct 1320 

cttagttcct gctttgggtt tgctgtttgg gtactccttt gctaaagttt gtacgctgcc 1380 

tcttcctgtt tgtaaaactg ttgctattga aagtgggatg ttaaatagtt tcttagctct 1440 

tgccgttatt cagctgtctt ttccacagtc caaggccaat ttagcttctg tggctccttt 1500 

tacagtagcc atgtgttctg gatgtgaaat gttactgatc attctagttt acaaggctaa 1560 

gaaaagatgt atctttttct tacaagataa aaggaaaaga aatttcctaa tctaacaatt 1620 

aaagcattac tgaattccta ctctgggtag ggcactgtgg gagagtaaaa gaataactat 1680 

aatgatacct gctcttaagc cacttataat gtaattaaaa atctgacatt ggggggtaga 1740 

gaatacatat atatatatac acacacacac acacaca 1777 



